Integrated cTAKES for Concept Mention Detection and Normalization

نویسندگان

Hongfang Liu

Kavishwar B. Wagholikar

Siddhartha Jonnalagadda

Sunghwan Sohn

چکیده

We participated Task 1 using an existing system MedTagger implemented in integrated cTAKES (icTAKES). The concept mention detection is based on Conditional Random Fields (CRF) and the concept mention normalization is based on a greedy dictionary lookup algorithm. A distinctive feature in MedTagger compared to other concept mention detection systems is the incorporation of dictionary lookup results into a machine learning framework for sequential labeling. Dictionary lookup results of MedLex and semantic vectors representing distributed semantics were used as features. Overall, the precision, recall, and F-measure of our best run for concept mention are 0.8, 0.573, and 0.668 respectively for strict evaluation and 0.939, 0.766, and 0.844 for relaxed evaluation. The accuracy of our best run for concept mention normalization is 54.6% and 87.0% for strict and relaxed mapping, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

QuickUMLS: a fast, unsupervised approach for medical concept extraction

Entity extraction is a fundamental step in many health informatics systems. In recent years, tools such as MetaMap and cTAKES have been widely used for medical concept extraction on medical literature and clinical notes; however, relatively little interest has been placed on their scalability to large datasets. In this work, we present QuickUMLS: a fast, unsupervised, approximate dictionary mat...

متن کامل

TeamUEvora at Clef eHealth 2014 Task2a

We present our first participation in a ShARe/CLEF eHealth Lab contributing for task 2a. Task 2 is an extension of the 2013 lab task 1 and consists of information extraction from clinical texts for Disease/Disorder Template Filling; task 2a aims at predicting each attribute’s normalization value. This work constitutes a preliminary approach to the problem of extracting and handling information ...

متن کامل

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications

We aim to build and evaluate an open-source natural language processing system for information extraction from electronic medical record clinical free-text. We describe and evaluate our system, the clinical Text Analysis and Knowledge Extraction System (cTAKES), released open-source at http://www.ohnlp.org. The cTAKES builds on existing open-source technologies-the Unstructured Information Mana...

متن کامل

Medical Disorder Recognition with Structural Support Vector Machines

In this paper we present two systems that address the issues of disorder recognition and normalization submitted by the authors as defined by the CLEF/ShARe Evaluation Lab. The first approach to the tasks formed a baseline approach using the cTakes system. Our second approach leveraged Structural Support Vector Machines with an array of feature types including lexical, semantic and cluster base...

متن کامل

CUAB: Supervised Learning of Disorders and their Attributes using Relations

We implemented an end-to-end system for disorder identification and slot filling. For identifying spans for both disorders and their attributes, we used a linear chain conditional random field (CRF) approach coupled with cTAKES for pre-processing. For combining disjoint disorder spans, finding relations between attributes and disorders, and attribute normalization, we used l2-regularized l2-los...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Integrated cTAKES for Concept Mention Detection and Normalization

نویسندگان

چکیده

منابع مشابه

QuickUMLS: a fast, unsupervised approach for medical concept extraction

TeamUEvora at Clef eHealth 2014 Task2a

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications

Medical Disorder Recognition with Structural Support Vector Machines

CUAB: Supervised Learning of Disorders and their Attributes using Relations

عنوان ژورنال:

اشتراک گذاری